Improving Search and Exploration in Tag Spaces Using Automated Tag Clustering
نویسندگان
چکیده
In recent years we have experienced an increase in the usage of tags to describe resources. However, the free nature of tagging presents some challenges regarding the search and exploration of tag spaces. In order to deal with these challenges we propose the Semantic Tag Clustering Search (STCS) framework. The framework first groups syntactic variations using several measures based on the Levenshtein distance and the cosine similarity based on tag co-occurrences. We find that a measure that combines the newly introduced variable cost Levenshtein similarity measure with the cosine similarity significantly outperforms the other methods we evaluated in terms of precision. After grouping syntactic variations, the framework clusters semantically related tags using the cosine similarity based on tag co-occurrences. We compare the STCS framework to a state-of-the-art clustering technique and find that the STCS framework performs significantly better in terms of precision. For the evaluation we used a large data set gathered from Flickr, which contains all the pictures uploaded in the year 2009.
منابع مشابه
Improving the Exploration of Tag Spaces Using Automated Tag Clustering
Due to the increasing popularity of tagging, it is important to overcome challenges resulting from the free nature of tagging, such as the use of synonyms, homonyms, syntactic variations, etc. The Semantic Tag Clustering Search (STCS) framework deals with these challenges by detecting syntactic variations of tags and by clustering semantically related tags. We evaluate our framework using Flick...
متن کاملAutomated Tag Clustering: Improving search and exploration in the tag space
In this paper we discuss the use of clustering techniques to enhance the user experience and thus the success of collaborative tagging services. We show that clustering techniques can improve the user experience of current tagging services. We first describe current limitations of tagging services, second, we give an overview of existing approaches. We then describe the algorithms we used for t...
متن کاملA Cluster-Based Approach for Search and Exploration of Tag Spaces
Although Semantic Web technology is increasingly becoming more and more important, tagging remains a popular method to describe Web resources. Therefore it is important to address the issues that are found in current tagging search engines, such as Flickr. We find that the free nature of tagging results in many issues for tag search engines, such as synonyms, homonyms, syntactic variations, etc...
متن کاملA semantic-based approach for searching and browsing tag spaces
In this thesis we propose the Semantic Tag Clustering Search framework (STCS). This framework consists of three parts. The first part deals with syntactic variations by clustering tags that are syntactic variations of each other and assigning a label to them. The second part of the framework addresses the problem of recognizing homonyms and identifying semantically related tags. The last, and f...
متن کاملA Comparative Study of Hierarchical Clustering Algorithms for Tagging Systems
With the rapid growth of information on the web, the so-called web2.0 services provide users with a simple way of managing a collection of resources. The collaborative nature of social bookmarking systems allows users to annotate their resources easily and explore other people resources in the network. However, data exploration in such large and complex networks is not always easy, due to lack ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Web Eng.
دوره 13 شماره
صفحات -
تاریخ انتشار 2014